REMARKS 

Favorable reconsideration is respectfully requested in view of the foregoing 
amendments and the following remarks. 

Pending Claims 

Prior to this Amendment, Claims 1-17, 27, 30-34 and 40-43 were pending. All the 
pending claims have been cancelled and replaced by Claims 44 - 88. 

Applicant reserves the right to file a divisional application with any of the claims 
cancelled herein. 

Overview of claims 

There are now 5 independent claims: 44, 47, 54, 58 and 68. 

Claim 44 covers synthetic proteins that comprise retroinverted peptides with 
specified sequences. 

Claim 47 covers synthetic proteins that comprise either retroinverted peptides with 
specified sequences or fragments of those retroinverted peptides. 

Claim 54 covers synthetic proteins that comprise either retroinverted peptides with 
specified sequences or homologs of those retroinverted peptides. 

Claim 58 covers synthetic proteins that comprise either retroinverted peptides with 
specified sequences, fragments of those retroinverted peptides, or homologs of those 
fragments. 

Claim 68 covers synthetic proteins, up to 50 amino acids in length, that comprise 
retroinverted peptides with specified sequences. The specified sequences are shorter than 
the specified sequences in claims 47, 54, 58, and 58. 
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Incorporation by reference 

Consistent with the guidelines in MPEP §608(p), Applicants are adding material 
from WO 98/51325 to the specification. WO 98/51325 is a published application that was 
incorporated by reference into the present application as can be seen from the following 2 
excerpts from the present application: 

. Previously, as disclosed and claimed in WO 98/51325, which is hereby 
incorporated by reference in its entirety, we have identified random peptides and 
their fragments, motifs, derivatives or peptidomimetics thereof which are capable 
of binding to GIT receptors such as the D2H, hSI, HPT1 and hPEPTI receptors 
(hereinafter "GIT targeting peptides"). (From page 3, lines 7-11). 

The present invention relates to retro-inverted peptides (also referred to 
herein as "targeting retro-inverted peptides" or "targeting retro-inversion peptides") 
that target specific receptor sites in vivo and/or promote uptake of active agents 
and/or enhance active agent delivery across the GIT into the systemic, portal or 
hepatic circulation. In particular, these retro-inverted peptides specifically bind to 
one or more of the human gastrointestinal tract receptors HPT1, HPEPTI, D2H, or 
hSI or their equivalents in other mammals and have general utility in targeting active 
agents to selected sites and/or selected tissues in the body in which receptors are 
expressed. These peptides are synthesized from D-amino acids and have a 
reverse sequence order of the GIT targeting agents disclosed and claimed in the 
above-referenced WO 98/51325. (From page 4, line 20 to page 5, Iine12). 
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Material incorporated by reference from WO 98/51325 is summarized in the 
following table 



Material 


Location in 
WO 98/51325 


Insert position in 
specification of 
present 
application 


Claims in which 
Material 
appears in 
application 


Information on GIT receptors 


page 45 line 25 to 
page 46, line 37 


Page 5, after line 
11 


None 


Sequences of 55 receptor-binding peptides 
identified from a phage library (SEQ ID 
NOS: 16-70) 


page 54, lines 5 
to page 55, lines 
37 


Immediately 
following above 
insert 


44, 47,54.58 


Sequences of 13 binding motifs (SEQ ID 
NOS: 71-83) 


Claims 6, 10, 14, 
18-20 


Sequence Listing 


68 


Sequences of 4 GIT receptors (SEQ ID 
NOS: 84-87) 




Sequence Listing 


None 


Reference to 80 or 90 percent homology; 


page 21, line 36 
to page 22, line 
16 


Page 6, after line 
14 


54,56,58,66 


fragment length is at least 5, 10 or 20 
amino acids 


page 21, line 36 
to page 22, line 
16 


Page 6, after line 
14 


47-49,58-60 


protein length is not more than 75 amino 
acids 


page 21, line 36 
to page 22, line 
16 1 


Page 6, after line 
14 


45,52,63 
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Changes made in text incorporated by reference 

Applicants have incorporated text from page 54, line 5 to page 55, line 55 of WO 
98/51325. The text corresponds to Table 7 of WO 98/51325 plus the paragraph that 
precedes it. Regarding that text, Applicants have made the following changes: 

1) Added an introductory phrase to the paragraph preceding the Table: -- As 
indicated in WO 98/51325-- 

2) Added a sentence after the paragraph preceding the Table: - Their insert 
sequences are summarized as follows: ~ 

3) Deleted the header "Table 7" 

4) Moved the title of the table, " TARGET BINDING PHAGE INSERT SEQUENCES" 
to become the header to the right column : -TARGET BINDING PHAGE INSERT 
SEQUENCE- 

5) Changed the SEQ ID Nos from 1-55 to 16-70. 

Support for Amendments 

The following examples of support for any given claim limitation are intended to be 
illustrative, not exhaustive. ^'^"^ 

Support for newly added amino acid sequences 

The SEQ ID NOs of newly added sequences incorporated by reference from WO 
98/51325 are presented in the following Table together with their corresponding SEQ ID 
NOs from WO 98/51325. 
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Qpn in MHc 

OCVJ lU INvJS 

in nrocont 
If 1 pi cocl 11 

pnnlip?itinn 

ClfJfJIfoCI UL/I 1 


in 
II 1 


Naiure ot 

ncintiHo/nrr»toin 


' 16-70 


1-55 


Taraptina aoent*^ 


71-83 


253-265 


Targeting agents 


84 


176 


hPEPTI receptor 


85 


178 


HPT1 receptor 


86 


179 


hSI receptor 


87 


181 


D2H receptor 



Support for "specifically binds to a Caco-2 cell membrane fraction" 



That phrase appears in the 5 newly added independent claims, 44, 47^54, 58, and 
68. The use of the Caco-2 assay to obtain data is described at pages 19-21 . Regarding 
the Caco-2 assay, generally, as a test for the functionality of fragments and homologs, the 
following from the present application is noted: 

The present invention also relates to derivatives (including but not limited to 
fragments) of these retroinverted peptides, which derivatives are functionally similar 
to the retro-invert peptides (that is, capable of displaying one or more known 
functional activities of the retro-inverted peptides). These functional activities include 
but are not limited to the ability to bind or to compete with binding to the gastro- 
intestinal tract receptors HPT1 , HPEPTI , D2H or hSI or the ability to be bound by 
an antibody directed against the retro-inverted peptide . Derivatives can be tested 
for the desired activity bv procedures known in the art, including binding to a 
receptor domain or to Caco-2 cells, in vitro, or to intestinal tissue, in vitro or in vivo. 
(See page 5, lines 3-12, of the present application; underlining added here) 



Support for the limitation that the synthetic protein does not exceed 75 amino acids in 
length 

Support is found in material incorporated by reference from PCT application, page 
21, line 36- page 22, line 5. 
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Support for the limitation that the synthetic protein does not exceed 50 amino acids in 
length 

Support is found in Claim 4 of the present application as filed. 

Support for the limitation that the fragments of specified retroinverted peptides are at least 
5, 10 or 20 amino acids in length 



Support is found in the material incorporated by reference from the^CT application, 
page 21 , line 36 - page 22, line 4. ' 

Support for the limitation that the homologs of specified retroinverted peptides show not 
more than 80 or 90 percent homology (but less than 100%) 

Support for 80% and 90% is found in the material incorporated by reference from 



has less than 100% homology. 

Support for the limitation that the homologs of specified retroinverted peptides meet one of 
four tests based of amino acid functional eguivalency 

This claim limitation, including the specification of 4 types of amino acid functional 
equivalency, finds support in the present application as filed, page 5, lines 24-29. 

Support for claims which cover glycosylation, acetylation, phosphorylation, and amidation 
Such claims find support in the present application as filed, page 5, line 30 to page 
6, line 1. 

Support for synthetic proteins with an added dansyl-lysine group 

Such dansylated derivatives are made routinely for purposes of the CaCo-2 binding 
assay. (See pages 19-21 of the present application as filed). 





^e PCT applicationv.page 21, line 36 - page 22, line 11. Also, a "homolog", by definition, 
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Support for claims involving nanoparticles or microparticles. also size range 

See claims 40-42 and pages 22-25 of the application as filed. As to particle sizes 
between 10 nm and 500 |am, see page 22, lines 5-8, 

Support for drug classes and specific drugs covered in the Claims 
See the application as filed, page 7, line 29 to page 9, line 1 . 

Support for the drug being insulin or leuprolide in the claims 
See,the application as filed, claim 43 and pages 25-26. 

Support for modifications to Table 1 of the present application 

A number of changes have been made for clarity and consistency: 

A column specifying the SEQ ID NO has been added at the left of the Table. 

The K(dns) group has been eliminated from the sequence in rows 1 through 6. As a 

result, the sequences in rows 1-6 of the table now precisely reflect the sequences in the 

Sequence Listing previously submitted in this case for SEQ ID NOS: 1-6. 

SEQ ID NOs 1-6, with their additonal K(dns) moieties, are now in rows 7-12 of Table 

1 . The K(dns) moiety is a dansyl-lysine mnoiety added to various peptides to make them 

detectable in the binding assays. 

Modifications to Table 3 of the present application 

Consistent with the amendments to Table 1, Table 3 has also been amended 
compared to the version submitted in the Amendment of October 5, 2001 . The 
amendments are as follows: 

Row 2, ZEIan129, the SEQ ID NO: has been changed from 4 to 12. 

Row 3, ZEIan144, the SEQ ID NO: has been changed from 1 to 9. 
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Row 5, ZEIan091, the SEQ ID NO: has been changed from 6 to 14. 
Row 6, ZEIan146, the SEQ ID NO: has been changed from 3 to 11 . 

Appendix to this Amendment 

Applicants have attached an Applendix with copies of those pages from the WO 
98/51325 that have the material that was incorporated via the present Amendment into the 
present application. 

Sequence Listing 

It is expected by the undersigned that an "AMENDMENT with Revised Sequence 
Listing" will be hand-delivered today to Group 1600 for Examiner Hope Robinson. 

Response to rejections in Office Action of January 2, 2002. 

Rejection of Claims 1-17, 27, 30-34 and 40-43 under 35 U.S.C. 112, first paragraph 
(Paragraph 2 of the Office Action) 

The Examiner has rejected Claims 1-17, 27, 30-34 and 40-43 under 35 U.S.C. 
§112, first paragraph, stating that while being enabling for the retro-inverted peptides and 
the specific sequences (SEQ ID NOs: 1-3), the specification does not reasonably provide 
enablement for derivatives or fragments thereof or a binding portion thereof or a 
composition for treatment of any mammalian disease or disorder. This rejection is 
respectfully traversed for the reasons that follow. (Although the rejected claims having 
been replaced by the present Amendment, Applicants will respond to the rejection as if it 
was directed at each of the 5 independent claims now in the case.) 

Claim 44 covers synthetic proteins that comprise retroinverted peptides with 

specified sequences. Data in the application shows examples where receptor binding 

ability is retained when an L-form peptide is "converted" to the retroinverted form. 
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Claim 47 covers synthetic proteins that comprise either retroinverted peptides with 
specified sequences or fragments of those retroinverted peptides. To the extent that some 
such fragments do not retain binding ability as specified in the claim, such fragments are 
not covered by the claim. To determine which fragments of a retroinverted peptide will 
retain that peptide's ability to bind in the Caco-2 binding assay, it is only necessary to 
identify the minimum "core region" needed for such binding. This can be done by 
systematically testing smaller and smaller fragments of a peptide for binding ability. In one 
approach, one successively eliminates 3-amino acid sections from each end of the 40-mer 
until binding ability is lost. If, for example, the core fragment is a 10-mer positioned at the 
center of the 40-mer, then the deletion of a 3-mer, 6-mer, 9-mer, 12-mer, and 15-mer from 
either end (10 tests total) would not eliminate the binding ability. Deletions of an 18-mer 
from either end would eliminate it. To achieve finer resolution, deletions of 16-mers and 17- 
mers could be tested. In any case, a total of only about 16 tests would be sufficient to 
identify the core binding peptide. 

Claim 54 covers synthetic proteins that comprise either retroinverted peptides with 
specified sequences or homologs of those retroinverted peptides. As "homologs" implies 
similarity, the claim will tend to cover structures that retain binding ability. To the extent 
that some homologs do not retain the ability to bind as specified in the claim, such 
homologs are not covered by the claim. 

Claim 58 covers synthetic proteins that comprise either retroinverted peptides with 
specified sequences, fragments of those retroinverted peptides, or homologs of those 
fragments. The fragments that retain specific binding activity can be determined in a 
reasonable number of steps as outlined above. As "homologs" implies similarity, the claim 
will tend to cover structures that retain binding ability. Fragment homologs that do not 
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retain the ability to bind are not covered by the clainn. 

Claim 68 covers synthetic proteins that connprise retroinverted peptides with 
specified sequences. Data in the application shows examples where receptor binding 
ability is retained when an L-form peptide is "converted" to the retroinverted form. 

Applicants submit that the foregoing is responsive to the issues raised by the 
Examiner as to: 

I. Quantitation of Experimentation; 

II. Amount of direction or guidance presented; 

IV. Nature of the invention; 

V. State of the prior art and relative skill of those in the art; and 
VL Predictability or unpredictability of the art. 

where the Roman numerals for each issue are those used by the Examiner. 
The Examiner also raised issues III and VII as follows: 

III. Presence or absence of working examples. 

Applicants have included an example showing that orally delivered insulin-loaded 
nanoparticles coated with the retroinverted 15-mer peptide, ZEIan144 produce as good or 
better bioavailability of insulin as such particles coated with ZEIan 129, the L-peptide 
counterpart of ZEIan 144 (Figure 2 and Table 5). The ZEIan 144-coated insulin-loaded 
nanoparticles also showed a therapeutic effect, evidenced by the reduction of glucose 
levels (Figure 1). 

The retroinverted peptide ZEIan 146 provided measureable bioavailability, about 
20% that provided by ZEIan 144. 

Applicants submit that it is reasonable to extrapolate their success with ZEIan 144 
and ZEIan 146 to the retroinverted forms of other peptides that are receptor binders. 
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Vll. Breadth of the claims. 
The Examiner has stated that the claims encompass any disease/disorder In response, 
Applicants have amended the claims so that they are more specific as to the types of 
active agents envisioned. Applicants submit that, by providing more specificity as to what 
consititutes an active agent, Applicants inherently describe a corresponding disorder or 
disease known in the art to be treatable by that agent. 

The Examiner has also stated that the claims cover any derivative/fragment or 
portion thereof. The claims presently in the case only cover those derivatives/fragments 
that show specific binding. 

Rejection of Claims 1-17, 27, 30-34 and 40-43 under 35 U.S.C. 112, second paragraph 
(Paragraph 3 of the Office Action) 

The Examiner has rejected Claims 1-17, 27, 30-34 and 40-43 under 35 U.S.C. 

§112, second paragraph, as being indefinite as follows: 

1) Claim 1 and dependent claims are rejected on the grounds that the recitation of 
"HPT1, hPEPTI, D2H and hSI" is insufficiently definite. Applicants no longer use these 
terms in the claims. 

2) As to all the rejected claims, the Examiner has suggested using the qualifier 
"synthetic" or "isolated". Applicants use "synthetic" in the new claims. 

3) Claims 2 and 13 are rejected on the grounds that "binding portion" is unclear. In 
the new claims, that term is not used. 

4) Claims 4-7 are rejected on the grounds that they lack antecedent basis and 
suggests that Claim 1 be amended to recite specific sequences. The independent claims 
that have replaced Claim 1 recite specific sequences. 

5) Claim 8 is rejected on the grounds that the meaning of the word "material" is 
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unclear. Although Claim 8 has been cancelled, the word "material" appears in new claims 
similar to Claim 8. In those claims (as in Claim 8), "Material" is intended to refer to any 
material that comprises the active agents specified in the claim, consistent with a major 
purpose of the invention - to be able to direct agent-loaded compositions to the GIT 
receptors. 

6) Claims 8 and 13 are rejected on the grounds that no specific disease or disorder 
is described. As noted above, the new claims specify classes of drugs, and the drugs imply 
specific diseases. 

7) Claim 16 is rejected on the grounds that it is unclear how the composition 
"facilitates" transport of the active agent. The word "facilitates" is not in the new claims. 

8) Claim 30 is rejected on the grounds that that there is no antecedent basis for 
"one or more functional activities of said peptide". The phrase does not appear in the new 
claims. 



In view of the foregoing remarks, it is respectfully submitted that all of the 



claims now pending in this application are allowable. 



July 2002 



Respectfully submitted, 
CAESAR, RIVISE, BERNSTEIN, 
COHEN & POKOTILOW, LTD. 




Allan H. Fried 

Reg. No. 31,253 

Seven Penn Center, 12th Floor 

1635 Market Street 

Philadelphia, PA 19103-2212 

(215) 567-2010 

Attorneys for Applicants 
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AMENDMENTS WITH MARKINGS SHOWING CHANGES 



IN THE SPECIFICATION 

Table 1, page 19, already amended on October 5, 2001, is further amended as 
follows: 



SEQ 

ID 

NO: 


Name 


Description 


Sequence 


1 


SEQ ID N0:1 
[Zelan 144] 


PAX2 15 mer fragment-D 
form retroinversion 


rtrlrrnhsshkant 
[K(dns)-rtrlrrnhsshkant] 


2 


SEQ ID N0:2 
[Zelan 145] 


P31 16 mer fragment- D 
form retroinversion 


gphrrgrpnsrsskrt 
[K(dns)- gphrrgrpnsrsskr] 


3 


SEQ ID N0:3 
[Zelan 1146] 


HAX42 14 mer fragment- D 
form retroinversion 


atsnanaccnvdoD 
[K(dns)- gtsngngccnydgp ] 


4 


SEQ ID N0:4 


PAX2 15 mer fragment 


TNAKHSSHNRRLRTR 
[K(dns)- 

TNAKHSSHNRRLRTR] 


5 


SEQ ID NO. 5 
[Zelan 031] 


P31 16 mer fragment 


TRKSSRSNPRGRRHPG 
[K(dns)- 

TRKSSRSNPRGRRHPG] 


6 


SEQ ID N0:6 
[Zelan 091] 


HAX42 14 mer fragment 


PGDYNCCGNGNSTG 
[K(dns)- 

PGDYNCCGNGNSTG ] 


9 


ZEIan144 


dansvlated 

PAX2 15 merfraqment-D 
form retroinversion 


K(dns)-rtrlrrnhsshkant 


10 


ZEIan145 


dansvlated 

P31 16 mer fragment- D 
form retroinversion 


K(dns)-qphrrqrpnsrsskrt 


11 


ZEIan146 


dansvlated 

HAX42 14 mer fragment- D 
form retroinversion 


K(dns)-atsngnaccnvdaD 


12 


ZEIan129 


dansvlated 

PAX2 15 mer fragment 


K(dns)- 

TNAKHSSHNRRLRTR 
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SEQ 

ID 

NO: 


Name 


Description 


Sequence 


13 


ZEIan031 


dansvlated 

P31 16 mer fraament 


K(dns)- 

TRKSSRSNPRGRRHPG 


14 


ZEIan091 


dansvlated 

HAX42 14 mer fraament 


K(dns)- 

PGDYNCCGNGNSTG 



Table 3, page 21, already amended on October 5, 2001, is further amended as follows: 



Name 


Sequence 


Kd (mitioI) 


ZEIan018 


K(clns)-STPPSREAYSRPYSVDSDSDTNAKHSSHNRRLRTRSRPNG (SEQ ID 
N0:7) 


>50.0 


ZEIan129 


K(dns)-TNAKHSSHNRRLRTR (SEQ ID fN0:41 NO: 12) 


29.6 


ZEIan144 


K(dns)-rtrlrrnhsshkant (SEQ ID [N0:1] N0:9) 


28.8 


ZEIan021 


K(dns)-SDHALGTNLRSDNAKEPGDYNCCGNGNSTGRKVFNRRRPSAIPT 
(SEQIDN0:8) 


6.7 


ZEIan091 


K(dns)-PGDYNCCGNGNSTG (SEQ ID [N0:6] N0:14) 


0.75 


ZEIan146 


K(dns)-gtsngngccnydgp (SEQ ID [N0:3] NO: 11) 


21.65 
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Please replace the paragraph at page 20, line 22 to page 21, line 2, already 
amended on October 5, 2001, with the following paragraph: 



SDHALGTNLRSDNAKEPGDYNCCGNGNSTGRKVFNRRRPSAIPT] ( SEQ ID NO:53: 
dansvlated version is SEQ ID N0:8) was given the arbitrary value of 1.00 for binding to 
P100 at a given peptide concentration determined from the signal-to-noise ratio data. Table 
2 shows the results of PI 00 assays with the HAX42 related peptides ZEIan021, Zelan091 
and ZEIan146. Assay number 1 was at 20 |ig/ml; 2 and 3 were at 50 |xg/ml; and 4 through 
8 were at 25 |xg/ml. The results for the retro-inverted form, Zelan 146 show reasonable 
binding compared to the HAX42 fragment Zelan091 and that the activity of the GIT 
targeting agent was not eliminated when converted to its retro-inverted form. ~ 

Please replace the paragraph at page 21, lines 5-11, already amended on October 
5, 2001, with the following paragraph: 

~Kd values, or the concentration of the peptide required to reach half maximal binding to 
Caco-2 PI 00 fractions, are given in Table 3 for ZEIan021, full length HAX42, [K(dns)- 
SDHALGTNLRSDNAKEPGDYNCCGNGNSTGRKVFNRRRPSAIPT] ( SEQ ID NO:53: 
dansvlated version is SEQ ID N0:8), HAX42 fragment ZEIan091, and the retro-inverted 
form of this fragment, ZEIan146 as well as for ZEIan018, full length PAX2, [K(dns)- 
STPPSREAYSRPYSVDS DSDTNAKHSSHNRRLRTRSRPNG] (SEQ ID N0:7 : dansvlated 
version is SEQ ID NO: 15) , PAX2 fragment ZEIan129, and the retro-inverted form of this 
fragment, ZELan144.~ 



ZEIan021, 



full 



length 



HAX42 



[K(dns)- 
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Appendix with pages from WO 98/512325 



The following pages are attached: 
21-22 
45-46 
54-55 
179-180 
184-189 
192-194 
234-237 



Material incorporated by reference into the present application is marked by a 
vertical black line in the right margins. 
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known in the art, including binding to a GIT transport 
receptor domain or to Caco-2 cells, in vitro, or to 
intestinal tissue, in vivo. {See the Examples infra.) 

In particular, derivatives can be made by altering 

5 GIT transport receptor-binding peptide sequences by 
substitutions, additions or deletions that provide for 
functionally equivalent molecules. Due to the degeneracy of 
nucleotide coding sequences, other nucleotide sequences which 
encode substantially the same amino acid sequence may be used 

10 in the practice of the present invention. These include but 
are not limited to nucleotide sequences which are altered by 
the substitution of different codons that encode a 
functionally equivalent amino acid residue within the 
sequence, thus producing a silent change. Likewise, the GIT 

15 transport receptor-binding peptide derivatives of the 

invention include, but are not limited to, those containing, 
as a primary amino acid sequence, all or part of the amino 
acid sequence of a GIT transport receptor-binding peptide 
including altered sequences in which functionally equivalent 

20 amino acid residues are substituted for residues within the 
sequence resulting in a silent change. For example, one or 
, more amino acid residues within the sequence can be 
substituted by another amino acid of a similar polarity which 
acts as a functional equivalent, resulting in a silent 

25 alteration. Substitutes for an amino acid within the 

sequence may be selected from other members of the class to 
which the amino acid belongs. For example, the nonpolar 
(hydrophobic) amino acids include alanine, leucine, 
isoleucine, valine, proline, phenylalanine, tryptophan and 

30 methionine. The polar neutral amino acids include glycine, 
serine, threonine, cysteine, tyrosine, asparagine, and 
glutamine. The positively charged (basic) amino acids 
include arginine, lysine and histidine. The negatively 
charged (acidic) amino acids include aspartic acid and 

35 glutamic acid. 

In a specific embodiment o£ the invention, proteins 
consisting of or, alternatively, comprising all or a fragment 
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of a GIT transport receptor-binding peptide consisting of at 
least 5, 10, 15, 20, 25, 30 or 35 (contiguous) amino acids of 
the full-length GIT transport receptor-binding peptide are 
provided. In a specific embodiment, such proteins are not 
5 more than 20, 30, 40, 50, or 75 amino acids in length. 
Derivatives or analogs of GIT transport receptor-binding 
peptides include but are not limited to those molecules 
comprising regions that are substantially homologous to GIT 
transport receptor-binding peptides or fragments thereof 

10 (e.g., at least 50%, 60%, 70%, 80% or 90% identity) (e.g., 
over an identical size sequence or when compared to an 
aligned sequence in which the alignment is done by a computer 
homology program known in the art) or whose encoding nucleic 
acid is capable of hybridizing to a coding GIT transport 

15 receptor-binding peptide sequence, under stringent, 
moderately stringent, or nonstringent conditions. 



receptor-binding derivatives of the invention are not known 
proteins with homology to the GIT transport receptor-binding 
20 peptides of the invention or portions thereof. 



derivatives and analogs of the invention can be produced by 
various methods known in the art. The manipulations which 
result in their production can occur at the gene or protein 

25 level. For example, the cloned GIT transport receptor- 
binding peptide gene sequence can be modified by any of 
numerous strategies known in the art (Maniatis, T. , 1990, 
Molecular Cloning, A Laboratory Manual, 2d ed.. Cold Spring 
Harbor Laboratory, Cold Spring Harbor, New York) . The 

30 sequence can be cleaved at appropriate sites with restriction 
endonuclease (s) , followed by further enzymatic modification 
if desired, isolated, and ligated in vitro. In the 
production of the gene encoding a derivative or analog of GIT 
transport receptor-binding peptides, care should be taken to 

35 ensure that the modified gene remains within the same 

translational reading frame uninterrupted by translational 



In a specific embodiment, the GIT transport 



The GIT transport receptor-binding peptide 
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form, together with a suitable amount of carrier so as to 
provide the form for proper administration to the patient. 

The Therapeutics of the invention can be formulated 
as neutral or salt forms. Pharmaceutically acceptable salts 
5 include those formed with free amino groups such as those 
derived from hydrochloric, phosphoric, acetic, oxalic, 
tartaric acids, etc., and those formed with free carboxyl 
groups such as those derived from sodium, potassium, 
ammonium, calcium, ferric hydroxides, isopropylamine, 
10 triethylamine, 2-ethylamino ethanol, histidine, procaine, 
etc . 

The amount of the Therapeutic of the invention 
which will be effective in the treatment of a particular 
disorder or condition will depend on the nature of the 

15 disorder or condition, and can be determined by standard 
clinical techniques. In addition, in vitro assays may 
optionally be employed to help identify optimal dosage 
ranges. The precise dose to be employed in the formulation 
will also depend on the route of administration, and the 

20 seriousness of the disease or disorder, and should be decided 
according to the judgment of the practitioner and each 
patient's circumstances. 

6. EXAMPLES 
25 6.1. Selection of GIT Receptor Targets 

The HPTl, hPEPTl, D2H, and hSI receptors were 
selected for cloning as GIT receptor targets based on several 
criteria, including: (1) expression on surface of epithelial 
cells in gastro-intestinal tract (GIT) ; (2) expression along 

30 the length of small intestine (HPTl, hPEPTI, D2H) ; 

(3) expression locally at high concentration (hSI) ; (4) large 
putative extracellular domains facing into the lumen of the 
GIT; and (5) extracellular domains that permit easy access 
and bioadhesion by targeting particles. 

35 The four recombinant receptor sites screened with 

the peptide libraries additionally have the following 
characteristics : 
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Receptor Characteristics 

D2H Transport of neutral/basic amino acids; a 

transport activating protein for a range of 
amino acid translocases 

5 hSI Metabolism of sucrose and other sugars; 

represents 9% of brush border membrane 
protein in Jejunum 

HPTl di/tri peptide transporter or facilitator of 

peptide transport 

hPEPTl di/tri peptide transporter 

Figures 1-4 (SEQ ID NOS:176, 178, 179, and 181, respectively) 
show the predicted amino acid sequences for hPEPTI, HPTl, hSI 
and D2H, respectively. 



15 



20 



6.2, Cloning of Extracellular Domain of 
Selected Receptor Site 

The following receptor domains were cloned and 

expressed as His-tag fusion proteins by standard techniques: 

Domain (amino 
Receptor acid residues) 

hPEPTl^ 391-571 

HPTl^ 29-273 

hSI^ 272-667 

D2H'* 387-685 



25 * Liang et al . , 1995, J. Biol. Chem. 270:6456-6463 

^ Dantzig et al . , 1994, Association of Intestinal 
Peptide Transport with a Protein Related to the 
Cadherin Super family 

Chantret et al . , Biochem. J. 285:915-923 
^ Bertran et al . , J. Biol. Chem. 268:14842-14949 



30 



The receptor proteins were expressed as His-tag 
fusion proteins and affinity purified under denaturing 
conditions, using urea or guanidine HCl, utilizing the pET 
His-tag metal chelate affinity for Ni-NTA Agarose (Hochuli, 
E., Purification of recombinant proteins with metal chelate 
adsorbent. Genetic Engineering, Principals and Methods (J.K. 
Setlow, ed.), Plenum Press, NY, Vol. 12 (1990), pp. 87-98). 
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plates were treated with PBS containing 0.1% phenyl hydrazine 
for one hour at 37 °C followed by two PBS washes and blocking 
for One hour with 0.5%BSA-PBS. The standard ELISA procedure 
was followed at this point. 
5 Phage which showed specificity to a GIT receptor 

was further characterized by ELISA on a variety of 
recombinant proteins. Phage which continued to exhibit GIT 
receptor specificity was sequenced. 

10 Table 7 

TARGET BINDING PHAGE INSERT SEQUENCES ; 
SEQ. 



15 



20 



30 



35 



hSI 
S15 


ID. NO. 

1 


RSGAYESPDGRGGRSYVGGGGGCGNIGRKHNLWGLRTASPACWD 


S21 


2 


S PRS FWPWSRHE SFG I SNYLGCGYRTC I SGTMTKS S P I YPRH S 


S22 


3 


SSSSDWGGVPGKWRERFKGRGCGISITSVLTGKPNPCPEPKAA 


SNilO 


4 


RVGQCTDSDVRRPWARSCAHQGCGAGTRNSHGCITRPLRQASAH 


SNi28 


5 


SHSGGMNRAYGDVFRELRDRWNATSHHTRPTPQLPRGPN 


SNi34 


6 


SPCGGSWGRFMQGGLFGGRTDGCGAHRNRTSASLEPPSSDY 


SNi38 


7 


RGAADQRRGWSENLGLPRVGWDAIAHNSYTFTSRRPRPP 


SNi45 


8 


SGGEVSSWGRVNDLCARVSWTGCGTARSARTDNKGFLPKHSSLR 


SNiAX2 


9 


SDSDGDHYGLRGGVRCSLRDRGCGLALSTVHAGPPSFYPKLSSP 


SNiAX4 


10 


RSLGNYGVTGTVDVTVLPMPGHANHLGVSSASSSDPPRR 


SNiAXG 


11 


RTTTAKGCLLGSFGVLSGCSFTPTSPPPHLGYPPHSVN 


SNiAXe 


12 


SPKLSSVGVMTKVTELPTEGPNAISIPISATLGPRNPLR 


D2H 






DAB3 


13 


RWCGAELCNSVTKKFRPGWRDHANPSTHHRTPPPSQSSP 


DAB7 


14 


RWCGADDPCGASRWRGGNSLFGCGLRCSAAQSTPSGRIHSTSTS 


DABIO 


15 


SKSGEGGDSSRGETGWARVRSHAMTAGRFRWYNQLPSDR 


DAB18 


16 


RSSANNCEWKSDWMRRACIARYANSSGPARAVDTKAAP 


DAB24 


17 


SKWSWSSRWGSPQDKVEKTRAGCGGSPSSTNCHPYTFAPPPQAG 


DAB30 


18 


SGFWEFSRGLWDGENRKSVRSGCGFRGSSAQGPCPVTPATIDKH 


DAX15 


19 


SESGRCRSVSRWMTTWQTQKGGCGSNVSRGSPLDPSHQTGHATT 


DAX23 


20 


REWRFAGPPLDLWAGPSLPSFNASSHPRALRTYWSQRPR 


DAX24 


21 


RMEDIKNSGWRDSCRWGDLRPGCGSRQWYPSNMRSSRDYPAGGH 


DAX27 


22 


SHPWYRHWNHGDFSGSGQSRHTPPESPHPGRPNATI 
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0 


0 






Pr'T/C NOR/1 (inRR 


DCX8 


23 


RYKHDIGCDAGVDKKSSSVRGGCGAHSSPPRAGRGPRGTMVSRL 


DCXll 


24 


SQGSKQCMQYRTGRLTVGSEYGCGMNPARHATPAYPARLLPRYR 


DCX26 


25 


SGRTTSE I SGLWGWGDDRSG YGWGNTLRPNY I PYRQATNRHRYT 


DCX33 


26 


RWNWTVLPATGGHYWTRSTDYHAINNHRPS I PHQH PTP I 


5 DCX3 6 


27 


SWSSWNWSSKTTRLGDRATREGCGPSQSDGCPYNGRLTTVKPRT 


DCX3 9 


28 


SGSLNAWQPRSWVGGAFRSHANNNLNPKPTMVTRHPT 


DCX42 


29 


RYSGLSPRDNGPACSQEATLEGCGAQRLMSTRRKGRNSRPGWTL 


DCX45 


30 


SVGNDKTSRPVSFYGRVSDLWNASLMPKRTPSSKRHDDG 



10 hPEPTl 





PAX9 


31 


RWPS VGYKGNGSDT I DVHSNDASTKRSL I YNHRRPLF P 




PAX14 


32 


RTFENDGLGVGRSIQKKSDRWYASHNIRSHFASMSPAGK 




PAX15 


33 


SYCRVKGGGEGGHTDSNLARSGCGKVARTSRLQHINPRATPPSR 




PAX16 


34 


SWTRWGKHTHGGFVNKSPPGKNATSPYTDAQLPSDQGPP 


15 


PAX17 


35 


SQVDSFRNSFRWYEPSRALCHGCGKRDTSTTRIHNSPSDSYPTR 




PAX18 


36 


SFLRFQSPRFEDYSRTISRLRNATNPSNVSDAHNNRALA 




PAX35 


37 


RSITDGGINEVDLSSVSIWLENANSHRAYRKHRPTLKRP 




PAX38 


38 


SSKVSSPRDPTVPRKGGNVDYGCGHRSSARMPTSALSSITKCYT 




PAX40 


39 


RASTQGGRGVAPEFGASVLGRGCGSATYYTNSTSCKDAMGHNYS 


20 


PAX43 


40 


RWCEKHKFTAARCSAGAGFERDASRPPQPAHRDNTNRNA 




PAX45 


41 


SFQVYPDHGLERHALDGTGPLYAMPGRWIRARPQNRDRQ 




PAX46 


42 


SRCTDNEQCPDTGTRSRSVSNARYFSSRLLKTHAPHRP 




P31 


43 


SARDSGPAEDGSRAVRLNGVENANTRKSSRSNPRGRRHP 




P90 


44 


SSADAEKCAGSLLWWGRQNNSGCGSPTKKHLKHRNRSQTSSSSH 


25 


5PAX3 


45 


RPKISIVADAYSSQDGAAAEETSHASNAARKSPKHKPLRRP 




5PAX5 


46 


RGSTGTAGGERSGVLNLHTRDNASGSGFKPWYPSNRGHK 




5PAX7 


47 


RWGWERSPSDYDSDMDLGARRYATRTHRAPPRVLKAPLP 




5PAX12 


48 


RGWKCEGSQAAYGDKDIGRSRGCGSITKNNTNHAHPSHGAVAKI 


30 


HPT-1 








HAX9 


49 


SREEANWDGYKREMSHRSRFWDATHLSRPRRPANSGDPN 




HAX35 


50 


EWYSWKRSSKSTGLGDTATREGCGPSQSDGCPYNGRLTTVKPRK 




HAX40 


51 


REFAERRLWGCDDLSWRLDAEGCGPTPSNRAVKHRKPRPRSPAL 




HAX42 


52 


SDHALGTNLRSDNAKEPGDYNCCGNGNSTGRKVFNRRRPSAIPT 


35 


HCA3 


53 


RHISEYSFANSHLMGGESKRKGCGINGSFSPTCPRSPTPAFRRT 




H40 


54 


SRESGMWGSWWRGHRLNSTGGNANMNASLPPDPPVSTP 




PAX2 


55 


STPPSREAYSRPYSVDSDSDTNAKHSSHNRRLRTRSRPN 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:175: 

Arg Arg Leu Arg Thr Arg Ser Arg Pro Asn 
15 10 

(2) INFORMATION FOR SEQ ID NO: 176: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 708 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 



Met 


Gly 


Met 


Ser 


Lys 


Ser 


His 


Ser 


Phe 


Phe 


Gly 


Tyr 


Pro 


Leu 


Ser 


He 


1 








5 










10 










15 




Phe 


Phe 


He 


Val 
20 


Val 


Asn 


Glu 


Phe 


Cys 
25 


Glu 


Arg 


Phe 


Ser 


Tyr 
30 


Tyr 


Gly 


Met 


Arg 


Ala 
35 


He 


Leu 


He 


Leu 


Tyr 
40 


Phe 


Thr 


Asn 


Phe 


He 
45 


Ser 


Trp 


Asp 


Asp 


Asn 
50 


Leu 


Ser 


Thr 


Ala 


He 
55 


Tyr 


His 


Thr 


Phe 


Val 
60 


Ala 


Leu 


Cys 


Tyr 


Leu 


Thr 


Pro 


He 


Leu 


Gly 


Ala 


Leu 


He 


Ala 


Asp 


Ser 


Trp 


Leu 


Gly 


Lys 


65 










70 










75 










80 


Phe 


Lys 


Thr 


He 


Val 
85 


Ser 


Leu 


Ser 


He 


Val 
90 


Tyr 


Thr 


He 


Gly 


Gin 
95 


Ala 


Val 


Thr 


Ser 


Val 
100 


Ser 


Ser 


He 


Asn 


Asp 
105 


Leu 


Thr 


Asp 


His 


Asn 
110 


His 


Asp 


Gly Thr 


Pro 


Asp 


Ser 


Leu 


Pro 


Val 


His 


Val 


Val 


Leu 


Ser 


Leu 


He 


Gly 






115 










120 










125 








Leu 


Ala 
130 


Leu 


He 


Ala 


Leu 


Gly 
135 


Thr 


Gly 


Gly 


He 


Lys 
140 


Pro 


Cys 


Val 


Ser 


Ala 


Phe 


Gly 


Gly 


Asp 


Gin 


Phe 


Glu 


Glu 


Gly Gin 


Glu 


Lys 


Gin Arg 


Asn 


145 










150 










155 










160 


Arg 


Phe 


Phe 


Ser 


He 
165 


Phe 


Tyr 


Leu 


Ala 


He 
170 


Asn 


Ala 


Gly 


Ser 


Leu 
175 


Leu 


Ser 


Thr 


He 


He 
180 


Thr 


Pro 


Met 


Leu 


Arg 
185 


Val 


Gin 


Gin 


Cys 


Gly 
190 


He 


His 


Ser 


Lys 


Gin 


Ala 


Cys 


Tyr 


Pro 


Leu 


Ala 


Phe Gly 


Val 


Pro 


Ala 


Ala 


Leu 






195 










200 










205 








Met 


Ala 


Val 


Ala 


Leu 


He 


Val 


Phe 


Val 


Leu Gly 


Ser 


Gly 


Met 


Tyr 


Lys 




210 










215 










220 










Lys 


Phe 


Lys 


Pro 


Gin 


Gly 


Asn 


He 


Met 


Gly Lys 


Val 


Ala 


Lys 


Cys 


He 


225 










230 










235 










240 


Gly Phe 


Ala 


He 


Lys 


Asn 


Arg 


Phe 


Arg 


His Arg 


Ser 


Lys 


Ala 


Phe 


Pro 










245 










250 










255 




Lys Arg 


Glu 


His 


Trp 


Leu 


Asp 


Trp 


Ala 


Lys 


Glu 


Lys 


Tyr 


Asp 


Glu 


Arg 








260 










265 










270 






Leu 


He 


Ser 


Gin 


He 


Lys 


Met 


Val 


Thr 


Arg Val 


Met 


Phe 


Leu 


Tyr 


He 






275 










280 










285 








Pro 


Leu 


Pro 


Met 


Phe 


Trp 


Ala 


Leu 


Phe 


Asp Gin 


Gin 


Gly 


Ser 


Arg 


Trp 




290 










295 










300 










Thr 


Leu 


Gin 


Ala 


Thr 


Thr 


Met 


Ser 


Gly 


Lys 


He 


Gly 


Ala 


Leu 


Glu 


He 


305 










310 






315 








320 


Gin 


Pro 


Asp 


Gin 


Met 


Gin 


Thr 


Val 


Asn 


Ala 


He 


Leu 


He 


Val 


He 


Met 
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325 330 335 

Val Pro He Phe Asp Ala Val Leu Tyr Pro Leu He Ala Lys Cys Gly 

340 345 350 

Phe Asn Phe Thr Ser Leu Lys Lys Met Ala Val Gly Met Val Leu Ala 

355 360 365 

Ser Met Ala Phe Val Val Ala Ala He Val Gin Val Glu He Asp Lys 

370 375 380 

Thr Leu Pro Val Phe Pro Lys Gly Asn Glu Val Gin He Lys Val Leu 
385 390 395 400 

Asn He Gly Asn Asn Thr Met Asn He Ser Leu Pro Gly Glu Met Val 

405 410 415 

Thr Leu Gly Pro Met Ser Gin Thr Asn Ala Phe Met Thr Phe Asp Val 

420 425 430 

Asn Lys Leu Thr Arg He Asn He Ser Ser Pro Gly Ser Pro Val Thr 

435 440 445 

Ala Val Thr Asp Asp Phe Lys Gin Gly Gin Arg His Thr Leu Leu Val 

450 455 460 

Trp Ala Pro Asn His Tyr Gin Val Val Lys Asp Gly Leu Asn Gin Lys 
465 470 475 480 

Pro Glu Lys Gly Glu Asn Gly He Arg Phe Val Asn Thr Phe Asn Glu 

485 490 495 

Leu He Thr He Thr Met Ser Gly Lys Val Tyr Ala Asn He Ser Ser 

500 505 510 

Tyr Asn Ala Ser Thr Tyr Gin Phe Phe Pro Ser Gly He Lys Gly Phe 
515 520 525 

15 Thr He Ser Ser Thr Glu He Pro Pro Gin Cys Gin Pro Asn Phe Asn 
530 535 540 

Thr Phe Tyr Leu Glu Phe Gly Ser Ala Tyr Thr Tyr He Val Gin Arg 
545 550 555 560 

Lys Asn Asp Ser Cys Pro Glu Val Lys Val Phe Glu Asp He Ser Ala 

565 570 575 

Asn Thr Val Asn Met Ala Leu Gin He Pro Gin Tyr Phe Leu Leu Thr 

580 585 590 

Cys Gly Glu Val Val Phe Ser Val Thr Gly Leu Glu Phe Ser Tyr Ser 
20 595 600 605 

Gin Ala Pro Ser Asn Met Lys Ser Val Leu Gin Ala Gly Trp Leu Leu 

610 615 620 

Thr Val Ala Val Gly Asn He He Val Leu He Val Ala Gly Ala Gly 
625 630 635 640 

Gin Phe Ser Lys Gin Trp Ala Glu Tyr He Leu Phe Ala Ala Leu Leu 

645 650 655 

Leu Val Val Cys Val Val Phe Ala He Met Ala Arg Phe Tyr Thr Tyr 
660 665 670 

25 He Asn Pro Ala Glu He Glu Ala Gin Phe Asp Glu Asp Glu Lys Lys 
675 680 685 

Asn Arg Leu Glu Lys Ser Asn Pro Tyr Phe Met Ser Gly Ala Asn Ser 

690 695 700 

Gin Lys Gin Met 
705 

(2) INFORMATION FOR SEQ ID NO: 177: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 
(ix) FEATURE: 

35 (A) NAME/KEY: Coding Sequence 

(B) LOCATION: 88... 2583 
(D) OTHER INFORMATION: 
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CAC CAG ACT GGG ATA CCC ACT GTG GGC ATG GCA GTT GGT ATA CTG CTG 2466 
His Gin Thr Gly lie Pro Thr Val Gly Met Ala Val Gly lie Leu Leu 
780 785 790 

ACC ACC CTT CTG GTG ATT GGT ATA ATT TTA GCA GTT GTG TTT ATC CGC 2514 
Thr Thr Leu Leu Val He Gly He He Leu Ala Val Val Phe He Arg 
795 800 805 

ATA AAG AAG GAT AAA GGC AAA GAT AAT GTT GAA AGT GCT CAA GCA TCT 2562 
He Lys Lys Asp Lys Gly Lys Asp Asn Val Glu Ser Ala Gin Ala Ser 
810 815 820 825 

GAA GTC AAA CCT CTG AGA AGC TGAATTTGAA AAGGAATGTT TGAATTTATA TAGC 2617 
Glu Val Lys Pro Leu Arg Ser 
830 

AAGTGCTATT TCAGCAACAA CCATCTCATC CTATTACTTT TCATCTAACG TGCATTATAA 2677 

TTTTTTAAAC AGATATTCCC TCTTGTCCTT TAATATTTGC TAAATATTTC TTTTTTGAGG 2737 

TGGAGTCTTG CTCTGTCGCC CAGGCTGGAG TACAGTGGTG TGATCCCAGC TCACTGCAAC 2 797 

CTCCGCCTCC TGGGTTCACA TGATTCTCCT GCCTCAGCTT CCTAAGTAGC TGGGTTTACA 2857 

GGCACCCACC ACCATGCCCA GCTAATTTTT GTATTTTTAA TAGAGACGGG GTTTCGCCAT 2917 

TTGGCCAGGC TGGTCTTGAA CTCCTGACGT CAAGTGATCT GCCTGCCTTG GTCTCCCAAT 2977 

ACAGGCATGA ACCACTGCAC CCACCTACTT AGATATTTCA TGTGCTATAG ACATTAGAGA 3037 

GATTTTTCAT TTTTCCATGA CATTTTTCCT CTCTGCAAAT GGCTTAGCTA CTTGTGTTTT 3097 

TCCCTTTTGG GGCAAGACAG ACTCATTAAA TATTCTGTAC ATTTTTTCTT TATCAAGGAG 3157 

ATATATCAGT GTTGTCTCAT AGAACTGCCT GGATTCCATT TATGTTTTTT CTGATTCCAT 3217 

CCTGTGTCCC CTTCATCCTT GACTCCTTTG GTATTTCACT GAATTTCAAA CATTTGTCAG 3277 

AGAAGAAAAA AGTGAGGACT CAGGAAAAAT AAATAAATAA AAGAACAGCC TTTTGCGGCC 3337 

GCGAATTC 3345 

(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 832 amino acids 

20 (B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:178: 

Met II 

25 



15 



30 



35 



Met 


He 


Leu 


Gin 


Ala 


His 


Leu 


His 


Ser 


Leu 


Cys 


Leu 


Leu Met Leu Tyr 


1 








5 










10 






15 


Leu 


Ala 


Thr 


Gly 


Tyr 


Gly 


Gin 


Glu 


Gly 


Lys 


Phe 


Ser 


Gly Pro Leu Lys 








20 










25 








30 


Pro 


Met 


Thr 


Phe 


Ser 


He 


Tyr 


Glu 


Gly 


Gin 


Glu 


Pro 


Ser Gin He He 






35 










40 










45 


Phe 


Gin 


Phe 


Lys 


Ala 


Asn 


Pro 


Pro 


Ala 


Val 


Thr 


Phe 


Glu Leu Thr Gly 




50 










55 










60 




Glu 


Thr 


Asp 


Asn 


He 


Phe 


Val 


He 


Glu 


Arg 


Glu 


Gly Leu Leu Tyr Tyr 


65 










70 










75 




80 


Asn 


Arg 


Ala 


Leu 


Asp 


Arg 


Glu 


Thr 


Arg 


Ser 


Thr 


His 


Asn Leu Gin Val 








85 










90 






95 


Ala 


Ala 


Leu 


Asp 


Ala 


Asn 


Gly 


He 


He 


Val 


Glu 


Gly 


Pro Val Pro He 








100 










105 








110 


Thr 


He 


Glu 


Val 


Lys 


Asp 


He 


Asn 


Asp 


Asn 


Arg 


Pro 


Thr Phe Leu Gin 






115 










120 










125 


Ser 


Lys 


Tyr 


Glu 


Gly 


Ser 


Val 


Arg 


Gin 


Asn 


Ser 


Arg 


Pro Gly Lys Pro 




130 










135 










140 




Phe 


Leu 


Tyr 


Val 


Asn 


Ala 


Thr 


Asp 


Leu 


Asp 


Asp 


Pro 


Ala Thr Pro Asn 


145 










150 










155 




160 


Gly 


Gin 


Leu 


Tyr 


Tyr 


Gin 


He 


Val 


He 


Gin 


Leu 


Pro 


Met He Asn Asn 










165 










170 






175 


Val 


Met 


Tyr 


Phe 


Gin 


He 


Asn 


Asn 


Lys 


Thr 


Gly 


Ala 


He Ser Leu Thr 






180 










185 








190 
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Arq Glu Gly 


Ser 


Gin 


Glu Leu 


Asn 


Pro 


Ala 


Lys Asn 


Pro 


Ser 


Tyr 


Asn 




195 








200 








205 








Leu Val 


He 


Ser 


Val 


Lys Asp 


Met Gly Gly Gin Ser 


Glu 


Asn 


Ser 


Phe 


210 








215 








220 










Ser Asp 


Thr 


Thr 


Ser 


Val Asp 


He 


He 


Val 


Thr Glu 


Asn 


He 


Trp 


Lys 


225 








230 








235 








240 


Ala Pro 


Lys 


Pro 


Val 


Glu Met 


Val 


Glu 


Asn 


Ser Thr Asp 


Pro 


His 


Pro 








245 








250 








255 




lie Lys 


He 


Thr 


Gin 


Val Arg 


Trp 


Asn 


Asp 


Pro Gly Ala 


Gin 


Tyr 


Ser 




260 








265 








270 






Leu Val 


Asp 


Lys 


Glu 


Lys Leu 


Pro Arg 


Phe 


Pro Phe 


Ser 


He 


Asp 


Gin 




275 








280 








285 








Glu Gly Asp 


He 


Tyr 


Val Thr 


Gin 


Pro 


Leu 


Asp Arg 


Glu 


Glu 


Lys 


Asp 


290 








295 








300 










Ala Tyr 


Val 


Phe 


Tyr 


Ala Val 


Ala 


Lys 


Asp 


Glu Tyr 


Gly Lys 


Pro 


Leu 


305 








310 








315 








320 


Ser Tyr 


Pro 


Leu 


Glu 


He His 


Val 


Lys 


Val 


Lys Asp 


He 


Asn 


Asp 


Asn 








325 








330 








335 




Pro Pro 


Thr 


Cys 


Pro 


Ser Pro 


Val 


Thr 


Val 


Phe Glu 


Val 


Gin 


Glu 


Asn 






340 








345 








350 






Glu Arg 


Leu Gly Asn 


Ser He 


Gly Thr 


Leu 


Thr Ala 


His 


Asp 


Arg 


Asp 




355 








360 








365 








Glu Glu 


Asn 


Thr 


Ala 


Asn Ser 


Phe 


Leu 


Asn 


Tvr ArQ 


He 


Val 


Glu 


Gin 


370 








375 








380 










Thr Pro 


Lys 


Leu 


Pro 


Met Asp 


Gly Leu 


Phe 


Leu lie 


Gin 


Thr 


Tyr 


Ala 


385 








390 








395 








400 


Gly Met 


Leu 


Gin 


Leu 


Ala Lys 


Gin 


Ser 


Leu 


Lys Lys 


Gin 


Asp 


Thr 


Pro 








405 








410 








415 




Gin Tyr 


Asn 


Leu 


Thr 


He Glu 


Val 


Ser 


Asp 


Lys Asp 


Phe 


Lys 


Thr 


Leu 






420 








425 








430 






Cys Phe 


Val 


Gin 


He 


Asn Val 


He 


Asp 


He 


Asn Asp 


Gin 


He 


Pro 


He 


435 








440 








445 








Phe Glu 


Lys 


Ser Asp 


Tyr Gly Asn 


Leu 


Thr 


Leu Ala 


Glu 


Asp 


Thr 


Asn 


450 








455 








460 










lie Gly Ser Thr 


He 


Leu Thr 


He 


Gin 


Ala 


Thr Asp 


Ala 


Asp 


Glu 


Pro 


465 








470 








475 








480 


Phe Thr Gly 


Ser 


Ser 


Lys He 


Leu 


Tyr 


His 


He He 


Lys 


Gly 


Asp 


Ser 








485 








490 








495 




Glu Gly Arg 


Leu 


Gly Val Asp 


Thr 


Asp 


Pro 


His Thr 


Asn 


Thr 


Gly 


Tyr 






500 








505 








510 






Val lie 


He 


Lys 


Lys 


Pro Leu 


Asp 


Phe 


Glu 


Thr Ala 


Ala 


Val 


Ser 


Asn 




515 








520 








525 








He Val 


Phe 


Lys 


Ala 


Glu Asn 


Pro 


Glu 


Pro 


Leu Val 


Phe Gly 


Val 


Lys 


530 








535 








540 










Tyr Asn Ala 


Ser 


Ser 


Phe Ala 


Lys 


Phe 


Thr 


Leu He 


Val 


Thr 


Asp 


Val 


545 








550 








555 






560 


Asn Glu 


Ala 


Pro 


Gin 


Phe Ser 


Gin 


His 


Val 


Phe Gin 


Ala 


Lys 


Val 


Ser 








565 








570 






575 




Glu Asp Val 


Ala 


He 


Gly Thr 


Lys 


Val 


Gly Asn Val 


Thr 


Ala 


Lys 


Asp 






580 








585 








590 






Pro Glu Gly Leu Asp 


He Ser 


Tyr 


Ser Leu Arg Gly Asp Thr 


Arg 


Gly 




595 








600 








605 








Trp Leu 


Lys 


He 


Asp 


His Val 


Thr Gly Glu 


He Phe 


Ser 


Val 


Ala 


Pro 


610 








615 








620 










Leu Asp Arg 


Glu 


Ala 


Gly Ser 


Pro 


Tyr Arg Val Gin Val Val 


Ala 


Thr 


625 








630 








635 








640 


Glu Val 


Gly Gly Ser 


Ser Leu 


Ser 


Ser 


Val 


Ser Glu 


Phe 


His 


Leu 


He 








645 








650 








655 




Leu Met Asp Val 


Asn 


Asp Asn 


Pro 


Pro Arg 


Leu Ala 


Lys Asp 


Tyr 


Thr 






660 








665 








670 






Gly Leu Phe 


Phe 


Cys 


His Pro 


Leu 


Ser 


Ala 


Pro Gly Ser 


Leu 


He 


Phe 




675 








680 








685 








Glu Ala Thr Asp Asp Asp Gin His 


Leu 


Phe Arg Gly Pro His 


Phe 


Thr 


690 








695 








700 










Phe Ser Leu Gly Ser Gly Ser Leu Gin Asn Asp Trp Glu Val 


Ser 


Lys 


705 








710 








715 








720 
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15 



20 



lie 


Asn Gly 


Thr 


His 


Ala 


Arg 


Leu 


Ser 


Thr 


Arg 


His Thr Asp Phe Glu 






725 










730 




735 


Glu 


Arg Ala 


Tyr 


Val 


Val 


Leu 


He 


Arg 


He 


Asn 


Asp Gly Gly Arg Pro 




740 










745 






750 


Pro 


Leu Glu 


Gly 


He 


Val 


Ser 


Leu 


Pro 


Val 


Thr 


Phe Cys Ser Cys Val 




755 








760 








765 


Glu Gly Ser 


Cys 


Phe 


Arg 


Pro 


Ala 


Gly 


His 


Gin 


Thr Gly He Pro Thr 




770 








775 










780 


Val 


Gly Met 


Ala 


Val 


Gly 


He 


Leu 


Leu 


Thr 


Thr 


Leu Leu Val He Gly 


785 






790 










795 


800 


lie 


lie Leu 


Ala 


Val 


Val 


Phe 


He 


Arg 


He 


Lys 


Lys Asp Lys Gly Lys 








805 










810 




815 


Asp 


Asn Val 


Glu 


Ser 


Ala 


Gin 


Ala 


Ser 


Glu 


Val 


Lys Pro Leu Arg Ser 




820 










825 






830 



(2) INFORMATION FOR SEQ ID NO: 17 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1827 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:179: 



Met Ala Arg Lys Lys Phe Ser Gly Leu Glu He Ser Leu He Val Leu 

15 10 15 

Phe Val He Val Thr He He Ala He Ala Leu He Val Val Leu Ala 

20 25 30 

Thr Lys Thr Pro Ala Val Asp Glu He Ser Asp Ser Thr Ser Thr Pro 

35 40 45 

Ala Thr Thr Arg Val Thr Thr Asn Pro Ser Asp Ser Gly Lys Cys Pro 

50 55 60 

Asn Val Leu Asn Asp Pro Val Asn Val Arg He Asn Cys He Pro Glu 
65 70 75 80 

Gin Phe Pro Thr Glu Gly He Cys Ala Gin Arg Gly Cys Cys Trp Arg 

85 90 95 

Pro Trp Asn Asp Ser Leu He Pro Trp Cys Phe Phe Val Asp Asn His 

100 105 110 

Gly Tyr Asn Val Gin Asp Met Thr Thr Thr Ser He Gly Val Glu Ala 
115 120 125 

25 ^9 "^^^ ■^■^^ 

130 135 140 

Ser Val Leu Phe Thr Thr Gin Asn Gin Thr Pro Asn Arg Phe Arg Phe 
145 150 155 160 

Lys He Thr Asp Pro Asn Asn Arg Arg Tyr Glu Val Pro His Gin Tyr 

165 170 175 

Val Lys Glu Phe Thr Gly Pro Thr Val Ser Asp Thr Leu Tyr Asp Val 

180 185 190 

Lys Val Ala Gin Asn Pro Phe Ser He Gin Val He Arg Lys Ser Asn 
30 195 200 205 

Gly Lys Thr Leu Phe Asp Thr Ser He Gly Pro Leu Val Tyr Ser Asp 

210 215 220 

Gin Tyr Leu Gin He Ser Ala Arg Leu Pro Ser Asp Tyr He Tyr Gly 
225 230 235 240 

He Gly Glu Gin Val His Lys Arg Phe Arg His Asp Leu Ser Trp Lys 

245 250 255 

Thr Trp Pro He Phe Thr Arg Asp Gin Leu Pro Gly Asp Asn Asn Asn 
260 265 270 

35 Asn Leu Tyr Gly His Gin Thr Phe Phe Met Cys He Glu Asp Thr Ser 
275 280 285 

Gly Lys Ser Phe Gly Val Phe Leu Met Asn Ser Asn Ala Met Glu He 

290 295 300 

Phe He Gin Pro Thr Pro He Val Thr Tyr Arg Val Thr Gly Gly He 
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305 310 315 320 

Leu Asp Phe Tyr He Leu Leu Gly Asp Thr Pro Glu Gin Val Val Gin 

325 330 335 

Gin Tyr Gin Gin Leu Val Gly Leu Pro Ala Met Pro Ala Tyr Trp Asn 

340 345 350 

Leu Gly Phe Gin Leu Ser Arg Trp Asn Tyr Lys Ser Leu Asp Val Val 

355 360 365 

Lys Glu Val Val Arg Arg Asn Arg Glu Ala Gly He Pro Phe Asp Thr 

370 375 380 

Gin Val Thr Asp He Asp Tyr Met Glu Asp Lys Lys Asp Phe Thr Tyr 
385 390 395 400 

Asp Gin Val Ala Phe Asn Gly Leu Pro Gin Phe Val Gin Asp Leu His 

405 410 415 

Asp His Gly Gin Lys Tyr Val He He Leu Asp Pro Ala He Ser He 

420 425 430 

Gly Arg Arg Ala Asn Gly Thr Thr Tyr Ala Thr Tyr Glu Arg Gly Asn 

435 440 445 

Thr Gin His Val Trp He Asn Glu Ser Asp Gly Ser Thr Pro He He 

450 455 460 

Gly Glu Val Trp Pro Gly Leu Thr Val Tyr Pro Asp Phe Thr Asn Pro 
465 470 475 480 

Asn Cys He Asp Trp Trp Ala Asn Glu Cys Ser He Phe His Gin Glu 

485 490 495 

Val Gin Tyr Asp Gly Leu Trp He Asp Met Asn Glu Val Ser Ser Phe 
500 505 510 

n 5 He Gin Gly Ser Thr Lys Gly Cys Asn Val Asn Lys Leu Asn Tyr Pro 
515 520 525 

Pro Phe Thr Pro Asp He Leu Asp Lys Leu Met Tyr Ser Lys Thr He 

530 535 540 

Cys Met Asp Ala Val Gin Asn Trp Gly Lys Gin Tyr Asp Val His Ser 
545 550 555 560 

Leu Tyr Gly Tyr Ser Met Ala He Ala Thr Glu Gin Ala Val Gin Lys 

565 570 575 

Val Phe Pro Asn Lys Arg Ser Phe He Leu Thr Arg Ser Thr Phe Ala 
20 580 585 590 

Gly Ser Gly Arg His Ala Ala His Trp Leu Gly Asp Asn Thr Ala Ser 

595 600 605 

Trp Glu Gin Met Glu Trp Ser He Thr Gly Met Leu Glu Phe Ser Leu 

610 615 620 

Phe Gly He Pro Leu Val Gly Ala Asp He Cys Gly Phe Val Ala Glu 
625 630 635 640 

Thr Thr Glu Glu Leu Cys Arg Arg Trp Met Gin Leu Gly Ala Phe Tyr 
645 650 655 

25 Pro Phe Ser Arg Asn His Asn Ser Asp Gly Tyr Glu His Gin Asp Pro 
660 665 670 

Ala Phe Phe Gly Gin Asn Ser Leu Leu Val Lys Ser Ser Arg Gin Tyr 

675 680 685 

Leu Thr He Arg Tyr Thr Leu Leu Pro Phe Leu Tyr Thr Leu Phe Tyr 

690 695 700 

Lys Ala His Val Phe Gly Glu Thr Val Ala Arg Pro Val Leu His Glu 
705 710 715 720 

Phe Tyr Glu Asp Thr Asn Ser Trp He Glu Asp Thr Glu Phe Leu Trp 
30 725 730 735 

Gly Pro Ala Leu Leu He Thr Pro Val Leu Lys Gin Gly Ala Asp Thr 

740 745 750 

Val Ser Ala Tyr He Pro Asp Ala He Trp Tyr Tisp Tyr Glu Ser Gly 

755 760 765 

Ala Lys Arg Pro Trp Arg Lys Gin Arg Val Asp Met Tyr Leu Pro Ala 

770 775 780 

Asp Lys He Gly Leu His Leu Arg Gly Gly Tyr He He Pro He Gin 
785 790 795 800 

35 Glu Pro Asp Val Thr Thr Thr Ala Ser Arg Lys Asn Pro Leu Gly Leu 

805 810 815 

He Val Ala Leu Gly Glu Asn Asn Thr Ala Lys Gly Asp Phe Phe Trp 

820 825 830 

Asp Asp Gly Glu Thr Lys Asp Thr He Gin Asn Gly Asn Tyr He Leu 



- 187 - 



wo 98/51325 PCTAJS98/1 0088 



10 



835 840 845 

Tyr Thr Phe Ser Val Ser Asn Asn Thr Leu Asp lie Val Cys Thr His 

850 855 860 

Ser Ser Tyr Gin Glu Gly Thr Thr Leu Ala Phe Gin Thr Val Lys lie 
865 870 875 880 

Leu Gly Leu Thr Asp Ser Val Thr Glu Val Arg Val Ala Glu Asn Asn 

885 890 895 

Gin Pro Met Asn Ala His Ser Asn Phe Thr Tyr Asp Ala Ser Asn Gin 

900 905 910 

Val Leu Leu lie Ala Asp Leu Lys Leu Asn Leu Gly Arg Asn Phe Ser 

915 920 925 

Val Gin Trp Asn Gin lie Phe Ser Glu Asn Glu Arg Phe Asn Cys Tyr 

930 935 940 

Pro Asp Ala Asp Leu Ala Thr Glu Gin Lys Cys Thr Gin Arg Gly Cys 
945 950 955 960 

Val Trp Arg Thr Gly Ser Ser Leu Ser Lys Ala Pro Glu Cys Tyr Phe 

965 970 975 

Pro Arg Gin Asp Asn Ser Tyr Ser Val Asn Ser Ala Arg Tyr Ser Ser 

980 985 990 

Met Gly He Thr Ala Asp Leu Gin Leu Asn Thr Ala Asn Ala Arg He 

995 1000 1005 

Lys Leu Pro Ser Asp Pro He Ser Thr Leu Arg Val Glu Val Lys Tyr 

1010 1015 1020 

His Lys Asn Asp Met Leu Gin Phe Lys lie Tyr Asp Pro Gin Lys Lys 
025 1030 1035 1040 

1 5 Arg Tyr Glu Val Pro Val Pro Leu Asn He Pro Thr Thr Pro He Ser 
1045 1050 1055 

Thr Tyr Glu Asp Arg Leu Tyr Asp Val Glu He Lys Glu Asn Pro Phe 

1060 1065 1070 

Gly He Gin He Arg Arg Arg Ser Ser Gly Arg Val He Trp Asp Ser 

1075 1080 1085 

Trp Leu Pro Gly Phe Ala Phe Asn Asp Gin Phe He Gin He Ser Thr 

1090 1095 1100 

Arg Leu Pro Ser Glu Tyr He Tyr Gly Phe Gly Glu Val Glu His Thr 
20 105 1110 1115 1120 

Ala Phe Lys Arg Asp Leu Asn Trp Asn Thr Trp Gly Met Phe Thr Arg 

1125 1130 1135 

Asp Gin Pro Pro Gly Tyr Lys Leu Asn Ser Tyr Gly Phe His Pro Tyr 

1140 1145 1150 

Tyr Met Ala Leu Glu Glu Glu Gly Asn Ala His Gly Val Phe Leu Leu 

1155 1160 1165 

Asn Ser Asn Ala Met Asp Val Thr Phe Gin Pro Thr Pro Ala Leu Thr 
1170 1175 1180 

25 "^^^ -^^^ 

185 1190 1195 1200 

Thr Pro Gin Val Ala Thr Lys Gin Tyr His Glu Val He Gly His Pro 

1205 1210 1215 

Val Met Pro Ala Tyr Trp Ala Leu Gly Phe Gin Leu Cys Arg Tyr Gly 

1220 1225 1230 

Tyr Ala Asn Thr Ser Glu Val Arg Glu Leu Tyr Asp Ala Met Val Ala 

1235 1240 1245 

Ala Asn He Pro Tyr Asp Val Gin Tyr Thr Asp He Asp Tyr Met Glu 
30 1250 1255 1260 

Arg Gin Leu Asp Phe Thr He Gly Glu Ala Phe Gin Asp Leu Pro Gin 
265 1270 1275 1280 

Phe Val Asp Lys He Arg Gly Glu Gly Met Arg Tyr He He He Leu 

1285 1290 1295 

Asp Pro Ala He Ser Gly Asn Glu Thr Lys Thr Tyr Pro Ala Phe Glu 

1300 1305 1310 

Arg Gly Gin Gin Asn Asp Val Phe Val Lys Trp Pro Asn Thr Asn Asp 
1315 1320 1325 

35 He Cys Trp Ala Lys Val Trp Pro Asp Leu Pro Asn He Thr He Asp 
1330 1335 1340 

Lys Thr Leu Thr Glu Asp Glu Ala Val Asn Ala Ser Arg Ala His Val 
345 1350 1355 1360 

Ala Phe Pro Asp Phe Phe Arg Thr Ser Thr Ala Glu Trp Trp Ala Arg 
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1365 1370 1375 

Glu lie Val Asp Phe Tyr Asn Glu Lys Met Lys Phe Asp Gly Leu Trp 

1380 1385 1390 

He Asp Met Asn Glu Pro Ser Ser Phe Val Asn Gly Thr Thr Thr Asn 

1395 1400 1405 

Gin Cys Arg Asn Asp Glu Leu Asn Tyr Pro Pro Tyr Phe Pro Glu Leu 

1410 1415 1420 

Thr Lys Arg Thr Asp Gly Leu His Phe Arg Thr He Cys Met Glu Ala 
425 1430 1435 1440 

Glu Gin He Leu Ser Asp Gly Thr Ser Val Leu His Tyr Asp Val His 

1445 1450 1455 

Asn Leu Tyr Gly Trp Ser Gin Met Lys Pro Thr His Asp Ala Leu Gin 

1460 1465 1470 

Lys Thr Thr Gly Lys Arg Gly He Val He Ser Arg Ser Thr Tyr Pro 

1475 1480 1485 

Thr Ser Gly Arg Trp Gly Gly His Trp Leu Gly Asp Asn Tyr Ala Arg 

1490 1495 1500 

Trp Asp Asn Met Asp Lys Ser He He Gly Met Met Glu Phe Ser Leu 
505 1510 1515 1520 

Phe Gly He Ser Tyr Thr Gly Ala Asp He Cys Gly Phe Phe Asn Asn 

1525 1530 1535 

Ser Glu Tyr His Leu Cys Thr Arg Trp Met Gin Leu Gly Ala Phe Tyr 

1540 1545 1550 

Pro Tyr Ser Arg Asn His Asn He Ala Asn Thr Arg Arg Gin Asp Pro 
1555 1560 1565 

15 Ala Ser Trp Asn Glu Thr Phe Ala Glu Met Ser Arg Asn He Leu Asn 
1570 1575 1580 

He Arg Tyr Thr Leu Leu Pro Tyr Phe Tyr Thr Gin Met His Glu He 
585 1590 1595 1600 

His Ala Asn Gly Gly Thr Val He Arg Pro Leu Leu His Glu Phe Phe 

1605 1610 1615 

Asp Glu Lys Pro Thr Trp Asp He Phe Lys Gin Phe Leu Trp Gly Pro 

1620 1625 1630 

Ala Phe Met Val Thr Pro Val Leu Glu Pro Tyr Val Gin Thr Val Asn 

1635 1640 1645 

Ala Tyr Val Pro Asn Ala Arg Trp Phe Asp Tyr His Thr Gly Lys Asp 

1650 1655 1660 

He Gly Val Arg Gly Gin Phe Gin Thr Phe Asn Ala Ser Tyr Asp Thr 
665 1670 1675 1680 

He Asn Leu His Val Arg Gly Gly His He Leu Pro Cys Gin Glu Pro 

1685 1690 1695 

Ala Gin Asn Thr Phe Tyr Ser Arg Gin Lys His Met Lys Leu He Val 
1700 1705 1710 

25 Ala Ala Asp Asp Asn Gin Met Ala Gin Gly Ser Leu Phe Trp Asp Asp 
1715 1720 1725 

Gly Glu Ser He Asp Thr Tyr Glu Arg Asp Leu Tyr Leu Ser Val Gin 

1730 1735 1740 

Phe Asn Leu Asn Gin Thr Thr Leu Thr Ser Thr He Leu Lys Arg Gly 
745 1750 1755 1760 

Tyr He Asn Lys Ser Glu Thr Arg Leu Gly Ser Leu His Val Trp Gly 

1765 1770 1775 

Lys Gly Thr Thr Pro Val Asn Ala Val Thr Leu Thr Tyr Asn Gly Asn 
30 1780 1785 1790 

Lys Asn Ser Leu Pro Phe Asn Glu Asp Thr Thr Asn Met He Leu Arg 

1795 1800 1805 

He Asp Leu Thr Thr His Asn Val Thr Leu Glu Glu Pro He Glu He 

1810 1815 1820 

Asn Trp Ser 
825 



20 



35 



(2) INFORMATION FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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CTT TTC AC A CTC CCT GGA ACT CCT ATA ACT TAC TAT GGA GAA GAA ATT 1496 
Leu Phe Thr Leu Pro Gly Thr Pro lie Thr Tyr Tyr Gly Glu Glu lie 
470 475 480 

GGA ATG GGA AAT ATT GTA GCC GCA AAT CTC AAT GAA AGC TAT GAT ATT 1544 
Gly Met Gly Asn lie Val Ala Ala Asn Leu Asn Glu Ser Tyr Asp lie 
485 490 495 500 

AAT ACC CTT CGC TCA AAG TCA CCA ATG CAG TGG GAC AAT AGT TCA AAT 1592 
Asn Thr Leu Arg Ser Lys Ser Pro Met Gin Trp Asp Asn Ser Ser Asn 
505 510 515 

GCT GGT TTT TCT GAA GCT AGT AAC ACC TGG TTA CCT ACC AAT TCA GAT 164 0 
Ala Gly Phe Ser Glu Ala Ser Asn Thr Trp Leu Pro Thr Asn Ser Asp 
520 525 530 

TAC CAC ACT GTG AAT GTT GAT GTC CAA AAG ACT CAG CCC AG A TCG GCT 1688 
Tyr His Thr Val Asn Val Asp Val Gin Lys Thr Gin Pro Arg Ser Ala 
535 540 545 

TTG AAG TTA TAT CAA GAT TTA AGT CTA CTT CAT GCC AAT GAG CTA CTC 1736 
Leu Lys Leu Tyr Gin Asp Leu Ser Leu Leu His Ala Asn Glu Leu Leu 
550 555 560 

CTC AAC AGG GGC TGG TTT TGC CAT TTG AGG AAT GAC AGC CAC TAT GTT 1784 
3^5 Leu Asn Arg Gly Trp Phe Cys His Leu Arg Asn Asp Ser His Tyr Val 
565 570 575 580 

GTG TAC ACA AGA GAG CTG GAT GGC ATC GAC AGA ATC TTT ATC GTG GTT 1832 
Val Tyr Thr Arg Glu Leu Asp Gly lie Asp Arg He Phe He Val Val 
585 590 595 

CTG AAT TTT GGA GAA TCA ACA CTG TTA AAT CTA CAT AAT ATG ATT TCG 1880 
Leu Asn Phe Gly Glu Ser Thr Leu Leu Asn Leu His Asn Met He Ser 
20 600 605 610 

GGC CTT CCC GCT AAA ATA AGA ATA AGG TTA AGT ACC AAT TCT GCC GAC 1928 
Gly Leu Pro Ala Lys He Arg He Arg Leu Ser Thr Asn Ser Ala Asp 
615 620 625 

AAA GGC AGT AAA GTT GAT ACA AGT GGC ATT TTT CTG GAC AAG GGA GAG 1976 
Lys Gly Ser Lys Val Asp Thr Ser Gly He Phe Leu Asp Lys Gly Glu 
630 635 640 

25 

GGA CTC ATC TTT GAA CAC AAC ACG AAG AAT CTC CTT CAT CGC CAA ACA 2 024 
Gly Leu He Phe Glu His Asn Thr Lys Asn Leu Leu His Arg Gin Thr 
645 650 655 660 

GCT TTC AGA GAT AGA TGC TTT GTT TCC AAT CGA GCA TGC TAT TCC AGT 2072 
Ala Phe Arg Asp Arg Cys Phe Val Ser Asn Arg Ala Cys Tyr Ser Ser 
665 670 675 

30 GTA CTG AAC ATA CTG TAT ACC TCG TGT TAGGCACCTT TATGAAGAGA TGAAGAC 2126 
Val Leu Asn He Leu Tyr Thr Ser Cys 
680 685 

ACTGGCATTT CAGTGGGATT GTAAGCATTT GTAATAGCTT CATGTACAGC ATGCTGCTTG 2186 
GTGAACAATC ATTAATTCTT CGATATTTCT GTAGCTTGAA TGTAACCGCT TTAAGAAAGG 2246 
TTCTCAAATG TTTTGAAAAA AATAAAATGT TTAAAAGT 2284 



35 



(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 685 amino acids 

(B) TYPE: amino acid 

( C) STRANDEDNESS : 
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(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

Met Ala Glu Asp Lys Ser Lys Arg Asp Ser lie Glu Met Ser Met Lys 
c 1 5 10 15 

Gly Cys Gin Thr Asn Asn Gly Phe Val His Asn Glu Asp lie Leu Glu 

20 25 30 

Gin Thr Pro Asp Pro Gly Ser Ser Thr Asp Asn Leu Lys His Ser Thr 

35 40 45 

Arg Gly lie Leu Gly Ser Gin Glu Pro Asp Phe Lys Gly Val Gin Pro 

50 55 60 

Tyr Ala Gly Met Pro Lys Glu Val Leu Phe Gin Phe Ser Gly Gin Ala 
65 70 75 80 

10 Arg Tyr Arg lie Pro Arg Glu lie Leu Phe Trp Leu Thr Val Ala Ser 

85 90 95 

Val Leu Val Leu lie Ala Ala Thr lie Ala He He Ala Leu Ser Pro 

100 105 110 

Lys Cys Leu Asp Trp Trp Gin Glu Gly Pro Met Tyr Gin He Tyr Pro 

115 120 125 

Arg Ser Phe Lys Asp Ser Asn Lys Asp Gly Asn Gly Asp Leu Lys Gly 

130 135 140 

He Gin Asp Lys Leu Asp Tyr He Thr Ala Leu Asn He Lys Thr Val 
145 150 155 160 

Trp He Thr Ser Phe Tyr Lys Ser Ser Leu Lys Asp Phe Arg Tyr Gly 

165 170 175 

Val Glu Asp Phe Arg Glu Val Asp Pro He Phe Gly Thr Met Glu Asp 

180 185 190 

Phe Glu Asn Leu Val Ala Ala He His Asp Lys Gly Leu Lys Leu He 

195 200 205 

He Asp Phe He Pro Asn His Thr Ser Asp Lys His He Trp Phe Gin 
210 215 220 

20 l^eiJ Ser Arg Thr Arg Thr Gly Lys Tyr Thr Asp Tyr Tyr He Trp His 
225 230 235 240 

Asp Cys Thr His Glu Asn Gly Lys Thr He Pro Pro Asn Asn Trp Leu 

245 250 255 

Ser Val Tyr Gly Asn Ser Ser Trp His Phe Asp Glu Val Arg Asn Gin 

260 265 270 

Cys Tyr Phe His Gin Phe Met Lys Glu Gin Pro Asp Leu Asn Phe Arg 

275 280 285 

Asn Pro Asp Val Gin Glu Glu He Lys Glu He Leu Arg Phe Trp Leu 
25 290 295 300 

Thr Lys Gly Val Asp Gly Phe Ser Leu Asp Ala Val Lys Phe Leu Leu 
305 310 315 320 

Glu Ala Lys His Leu Arg Asp Glu He Gin Val Asn Lys Thr Gin He 

325 330 335 

Pro Asp Thr Val Thr Gin Tyr Ser Glu Leu Tyr His Asp Phe Thr Thr 

340 345 350 

Thr Gin Val Gly Met His Asp He Val Arg Ser Phe Arg Gin Thr Met 
355 360 365 

30 Asp "^y^ Ser Thr Glu Pro Gly Arg Tyr Arg Phe Met Gly Thr Glu 

370 375 380 

Ala Tyr Ala Glu Ser He Asp Arg Thr Val Met Tyr Tyr Gly Leu Pro 
385 390 395 400 

Phe He Gin Glu Ala Asp Phe Pro Phe Asn Asn Tyr Leu Ser Met Leu 

405 410 415 

Asp Thr Val Ser Gly Asn Ser Val Tyr Glu Val He Thr Ser Trp Met 

420 425 430 

Glu Asn Met Pro Glu Gly Lys Trp Pro Asn Trp Met He Gly Gly Pro 
35 435 440 445 

Asp Ser Ser Arg Leu Thr Ser Arg Leu Gly Asn Gin Tyr Val Asn Val 

450 455 460 

Met Asn Met Leu Leu Phe Thr Leu Pro Gly Thr Pro He Thr Tyr Tyr 
465 470 475 480 
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20 



Gly Glu Glu 


He 


Gly 


Met 


Gly 


Asn 


He 


Val 


Ala 


Ala 


Asn 


Leu 


Asn 


Glu 










485 










490 










495 




Ser 


Tyr Asp 


He 


Asn 


Thr 


Leu 


Arg 


Ser 


Lys 


Ser 


Pro 


Met 


Gin 


Trp 


Asp 








500 










505 










510 






Asn 


Ser 


Ser 


Asn 


Ala 


Gly 


Phe 


Ser 


Glu 


Ala 


Ser 


Asn 


Thr 


Trp 


Leu 


Pro 






515 










520 










525 








Thr 


Asn 


Ser 


Asp 


Tyr 


His 


Thr 


Val 


Asn 


Val 


Asp 


Val 


Gin 


Lys 


Thr 


Gin 




530 










535 










540 










Pro 


Arg 


Ser 


Ala 


Leu 


Lys 


Leu 


Tyr 


Gin 


Asp 


Leu 


Ser 


Leu 


Leu 


His 


Ala 


545 








550 










555 










560 


Asn 


Glu 


Leu 


Leu 


Leu 


Asn 


Arg 


Gly 


Trp 


Phe 


Cys 


His 


Leu 


Arg 


Asn 


Asp 










565 










570 










575 




Ser 


His 


Tyr 


Val 


Val 


Tyr 


Thr 


Arg 


Glu 


Leu Asp 


Gly 


He 


Asp 


Arg 


He 








580 










585 










590 






Phe 


He 


Val 


Val 


Leu 


Asn 


Phe 


Gly 


Glu 


Ser 


Thr 


Leu 


Leu 


Asn 


Leu 


His 






595 










600 










605 








Asn 


Met 


He 


Ser 


Gly 


Leu 


Pro 


Ala 


Lys 


He 


Arg 


He 


Arg 


Leu 


Ser 


Thr 




610 










615 










620 










Asn 


Ser 


Ala 


Asp 


Lys 


Gly 


Ser 


Lys 


Val 


Asp 


Thr 


Ser 


Gly 


He 


Phe 


Leu 


625 










630 










635 










640 


Asp 


Lys 


Gly Glu 


Gly 


Leu 


He 


Phe 


Glu 


His 


Asn 


Thr 


Lys 


Asn 


Leu 


Leu 








645 










650 










655 




His 


Arg 


Gin 


Thr 


Ala 


Phe 


Arg 


Asp 


Arg 


Cys 


Phe 


Val 


Ser 


Asn 


Arg 


Ala 








660 










665 










670 






Cys 


Tyr 


Ser 


Ser 


Val 


Leu 


Asn 


He 


Leu 


Tyr 


Thr 


Ser 


Cys 








675 










680 










685 









(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:182: 



Leu Val Pro Arg Gly Ser Pro Gly He Pro Gly Ser Arg Val Gly Gin 

15 10 15 

Cys Thr Asp Ser Asp Val Arg Arg Pro Trp Ala Arg Ser Cys Ala His 
20 25 30 

25 Gin Gly Cys Gly Ala Gly Thr Arg Asn Ser His Gly Cys He Thr Arg 
35 40 . 45 

Pro Leu Arg Gin Ala Ser 
50 

(2) INFORMATION FOR SEQ ID NO:183: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 amino acids 

30 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

Ser Ala Arg Asp Ser Gly Pro Ala Glu Asp Gly Ser Arg Ala Val Arg 
35 1 5 10 15 

Leu Asn Gly 



(2) INFORMATION FOR SEQ ID NO: 184: 
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WHAT IS CLAIMED IS ; 

1. A purified protein which * specif ically binds to 
a gastro-intestinal tract receptor selected from the group 

5 consisting of HPTl, hPEPTl, D2H, and hSI. 

2. A protein which binds specifically to a 
gastro-intestinal transport receptor selected from the group 
consisting of HPTl, hPEPTI, D2H, and hSI, in which the 

10 protein comprises an amino acid sequence selected from the 
group consisting of SEQ ID N0S:l-55 or a binding portion 
thereof . 

3. A protein which binds specifically to a 

15 gastro-intestinal transport receptor selected from the group 
consisting of HPTl, hPEPTI, D2H, and hSI, in which the amino 
acid sequence of the protein is selected from the group 
consisting of SEQ ID NOS:l-55, or a binding portion thereof. 

2 0 4. The protein of claim 2 which comprises the 

amino acid sequence substantially as set forth in: 
SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 22, SEQ ID NO: 23, SEQ 
ID NO: 30, SEQ ID NO: 43, SEQ ID NO: 46, or SEQ ID NO: 52, or 
a binding portion thereof. 

25 

5. The protein of claim 3, the amino acid 
sequence of which consists of the amino acid sequence 
substantially as set forth in: SEQ ID NO: 4, SEQ ID NO: 6, 
SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 30, SEQ ID NO: 43, 

30 SEQ ID NO: 46, or SEQ ID NO: 52, or a binding portion 
thereof . 

6. A protein of not more than 50 amino acids in 
length which specifically binds to a gastro-intestinal 

35 transport receptor selected from the group consisting of 
HPTl, hPEPTI, D2H, and hSI, in which the protein includes, 
positioned anywhere along its sequence, the contiguous amino 
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acid sequence of: Xaai Thr Xaa2 Xaa3 Ser Xaa4 Xaag Xaa^ Asn 
Xaa^ Arg (SEQ ID NO:253), where Xaa^ is Ser or Thr; Xaa2 is 
Arg or Lys; Xaa3 is Lys or Arg; Xaa^ is Ser or Leu; Xaaj is 
Arg, lie, Val, or Ser; Xaa^ is Ser, Tyr, Phe, or His; and Xaa^ 
5 is Pro, His or Arg. 

7. The protein of claim 6 which is not more than 
4 0 amino acids in length. 

10 8. The protein of claim 6 which is not more than 

3 0 amino acids in length. 

9. The protein of claim 6 which is not more than 
20 amino acids in length. 

15 

10. A protein of not more than 50 amino acids in 
length which specifically binds to a gastro-intestinal 
transport receptor selected from the group consisting of 
HPTl, hPEPTl, D2H, and hSI, in which the protein includes, 

20 positioned anywhere along its sequence, the contiguous amino 
acid sequence of: Asp Xaa^^ Asp Xaa2 Arg Arg Xaa3 Xaa4 (SEQ ID 
NO: 254) where Xaa^ is Ser, Ala, or Gly; Xaa2 is Val or Gin; 
Xaag is Pro, Gly, or Ser; and Xaa4 is Trp or Tyr. 

25 11. The protein of claim 10 which is not more than 

4 0 amino acids in length. 

12. The protein of claim 10 which is not more than 
3 0 amino acids in length. 

30 

13. The protein of claim 10 which is not more than 
2 0 amino acids in length. 

14. A protein of not more than 50 amino acids in 
35 length which specifically binds to a gastro-intestinal 

transport receptor selected from the group consisting of 
HPTl, hPEPTl, D2H, and hSI, in which the protein includes. 



- 235 - 



wo 98/51325 PCT/US98/10088 



positioned anywhere along its sequence, the contiguous amino 
acid sequence of: Val Arg Ser Gly Cys Gly Xaa^ Xaaj Ser Ser 
(SEQ ID NO:255); where Xaa-^ is Ala or Phe; and Xaaj is Arg or 
His, 

5 

15. The protein of claim 14 which is not more than 
40 amino acids in length. 

16. The protein of claim 14 which is not more than 
10 30 amino acids in length. 

17 . The protein of claim 14 which is not more than 
20 amino acids in length. 

15 18. A protein of not more than 50 amino acids in 

length which specifically binds to a gastro-intestinal 
transport receptor selected from the group consisting of 
HPTl, hPEPTl, D2H, and hSI , in which the protein includes, 
positioned anywhere along its sequence, the contiguous amino 
20 acid sequence of: NTRKSSRSNPR (SEQ ID NO: 256) or STKRSLIYNHR 
(SEQ ID NO: 257) or STGRKVFNRR (SEQ ID NO: 258) or TNAKHSSHNRR 
(SEQ ID N0:259) . 

19. A protein of not more than 50 amino acids in 
25 length which specifically binds to a gastro-intestinal 

transport receptor selected from the group consisting of 
HPTl, hPEPTl, D2H, and hSI, in which the protein includes, 
positioned anywhere along its sequence, the contiguous amino 
acid sequence of: DSDVRRPW (SEQ ID NO: 260) or AADQRRGW (SEQ 
30 ID NO:261) or DGRGGRSY (SEQ ID NO:262). 

20. A protein of not more than 50 amino acids in 
length which specifically binds to a gastro-intestinal 
transport receptor selected from the group consisting of 

3 5 HPTl, hPEPTI, D2H, and hSI, in which the protein includes, 
positioned anywhere along its sequence, the contiguous amino 
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acid sequence of: RVRS (SEQ ID NO: 263) or SVRSGCGFRGSS (SEQ 
ID N0:264) or SVRGGCGAHSS (SEQ ID NO:265). 

21. The protein of claim 1, 2, 3, 6, 10, 14, 18, 
5 19, or 20 which is purified. 

22. A composition comprising the protein of claim 
1, 2, 3, S, 10, 14, 18, 19, or 20, bound to a material 
comprising an active agent, said active agent being of value 

10 in the treatment of a mammalian disease or disorder. 

23. The composition of claim 22 in which the 
active agent is a drug. 

15 24. The composition of claim 22 in which the 

material is a particle containing the active agent. 

25. The composition of claim 22 in which the 
material is a slow-release device containing the drug. 

20 

26. The composition of claim 22 in which the 
protein is covalently or noncovalently bound to the material. 

27. A composition comprising a chimeric protein 
25 bound to a material comprising an active agent, in which the 

chimeric protein comprises a sequence selected from the group 
consisting of SEQ ID NOS:l-55 or a binding portion thereof 
fused via a covalent bond to an amino acid sequence of a 
second protein, in which the active agent is of value in the 
30 treatment of a mammalian disease or disorder. 

28. A composition comprising the protein of claim 
1, 2, 3, 6, 10, 14, 18, 19, or 20 covalently bound to a 
particle containing a drug. 

35 

29. A composition comprising the protein of claim 
1, 2, 3, 6, 10, 14, 18, 19, or 20 covalently bound to a drug. 



